NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Structured and Balanced Multicomponent and Multilayer Neural Networks

https://doi.org/10.1137/24M1675990

Zhang, Shijun; Zhao, Hongkai; Zhong, Yimin; Zhou, Haomin (October 2025, SIAM Journal on Scientific Computing)

In this work, we propose a balanced multicomponent and multilayer neural network (MMNN) structure to accurately and efficiently approximate functions with complex features in terms of both degrees of freedom and computational cost. The main idea is inspired by a multicomponent approach in which each component can be effectively approximated by a single-layer network, combined with a multilayer decomposition strategy to capture the complexity of the target function. Although MMNNs can be viewed as a simple modification of fully connected neural networks (FCNNs) or multilayer perceptrons (MLPs) by introducing balanced multicomponent structures, they achieve a significant reduction in training parameters, a much more efficient training process, and improved accuracy compared to FCNNs or MLPs. Extensive numerical experiments demonstrate the effectiveness of MMNNs in approximating highly oscillatory functions and their ability to automatically adapt to localized features. Our code and implementations are available at GitHub.
more » « less
Free, publicly-accessible full text available October 31, 2026
Why shallow networks struggle to approximate and learn high frequencies

https://doi.org/10.1093/imaiai/iaaf022

Zhang, Shijun; Zhao, Hongkai; Zhong, Yimin; Zhou, Haomin (July 2025, Information and Inference: A Journal of the IMA)

Abstract In this work, we present a comprehensive study combining mathematical and computational analysis to explain why a two-layer neural network struggles to handle high frequencies in both approximation and learning, especially when machine precision, numerical noise and computational cost are significant factors in practice. Specifically, we investigate the following fundamental computational issues: (1) the minimal numerical error achievable under finite precision, (2) the computational cost required to attain a given accuracy and (3) the stability of the method with respect to perturbations. The core of our analysis lies in the conditioning of the representation and its learning dynamics. Explicit answers to these questions are provided, along with supporting numerical evidence.
more » « less
How much can one learn from a single solution of a PDE?

Zhao, Hongkai; Zhong, Yimin (November 2023, Pure and applied functional analysis)

Full Text Available
On Enhancing Expressive Power via Compositions of Single Fixed-Size ReLU Network

Zhang, Shijun; Lu, Jianfeng; Zhao, Hongkai (October 2023, Proceedings of the 40th International Conference on Machine Learning)

Full Text Available
How Much Can One Learn a Partial Differential Equation from Its Solution?

https://doi.org/10.1007/s10208-023-09620-z

He, Yuchen; Zhao, Hongkai; Zhong, Yimin (October 2023, Foundations of Computational Mathematics)

Full Text Available
A data-driven and model-based accelerated Hamiltonian Monte Carlo method for Bayesian elliptic inverse problems

https://doi.org/10.1007/s11222-023-10262-y

Li, Sijing; Zhang, Cheng; Zhang, Zhiwen; Zhao, Hongkai (August 2023, Statistics and Computing)

Full Text Available
Quantitative PAT with simplified P _N approximation

https://doi.org/10.1088/1361-6420/abf318

Zhao, Hongkai; Zhong, Yimin (April 2021, Inverse Problems)
null (Ed.)
Full Text Available
A Dual Iterative Refinement Method for Non-rigid Shape Matching

https://doi.org/10.1109/CVPR46437.2021.01567

Xiang, Rui; Lai, Rongjie; Zhao, Hongkai (June 2021, Conference on Computer Vision and Pattern Recognition)

Full Text Available
Marchenko–Pastur law with relaxed independence conditions

https://doi.org/10.1142/s2010326321500404

Bryson, Jennifer; Vershynin, Roman; Zhao, Hongkai (January 2021, Random Matrices: Theory and Applications)
null (Ed.)
We prove the Marchenko–Pastur law for the eigenvalues of [Formula: see text] sample covariance matrices in two new situations where the data does not have independent coordinates. In the first scenario — the block-independent model — the [Formula: see text] coordinates of the data are partitioned into blocks in such a way that the entries in different blocks are independent, but the entries from the same block may be dependent. In the second scenario — the random tensor model — the data is the homogeneous random tensor of order [Formula: see text], i.e. the coordinates of the data are all [Formula: see text] different products of [Formula: see text] variables chosen from a set of [Formula: see text] independent random variables. We show that Marchenko–Pastur law holds for the block-independent model as long as the size of the largest block is [Formula: see text], and for the random tensor model as long as [Formula: see text]. Our main technical tools are new concentration inequalities for quadratic forms in random variables with block-independent coordinates, and for random tensors.
more » « less
Full Text Available
Efficient and Robust Shape Correspondence via Sparsity-Enforced Quadratic Assignment

https://doi.org/10.1109/CVPR42600.2020.00953

Xiang, Rui; Lai, Rongjie; Zhao, Hongkai (June 2020, CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available

« Prev Next »

Search for: All records